Wikispeech - a content management system for speech databases
نویسندگان
چکیده
In this paper we describe WikiSpeech, a content management system for the web-based creation of speech databases for the development of spoken language technology and basic research. Its main features are full support for the typical recording, annotation and project administration workflow, easy editing of the speech content, plus a fully localizable user interface. For the creation of a new speech database, it is only necessary to open a new project within WikiSpeech, provide a link to any static project information pages and upload the prompt material to be presented to the speakers. Recordings and annotation are performed via the WWW in a platform independent manner on any Java compatible computer. WikiSpeech currently has been localized to four languages: German, English, Romanian and Russian, and it is now used for production recordings at the Bavarian Archive for Speech Signals in Munich, Germany.
منابع مشابه
Speech recordings via the internet: an overview of the VOYS project in scotland
The VOYS (Voices of Young Scots) project aims to establish a speech database of adolescent Scottish speakers. This database will serve for speech recognition technology and sociophonetic research. 300 pupils will ultimately be recorded at secondary schools in 10 locations in Scotland. Recordings are performed via the Internet using two microphones (closetalk and desktop) in 22,05 kHz 16 bit lin...
متن کاملWikiSpeech - enabling open source text-to-speech for Wikipedia
We present WikiSpeech, an ambitious joint project aiming to (1) make open source text-to-speech available through Wikimedia Foundation’s server architecture; (2) utilize the large and active Wikipedia user base to achieve continuously improving text-to-speech; (3) improve existing and develop new crowdsourcing methods for text-to-speech; and (4) develop new and adapt current evaluation methods ...
متن کاملDeveloping a Standardized Medical Speech Recognition Database for Reconstructive Hand Surgery
Fast and holistic access to the patients’ clinical record is a major requirement of modern medical decision support systems (DSS). While electronic health records (EHRs) have replaced the traditional paper-based records in most healthcare organization, the data entry into these systems remains largely manual. Speech recognition technology promises substitution of the more convenient speech-base...
متن کاملمراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی
Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...
متن کاملContent Based Audio Retrieval Based on Hidden Markov Models Speech and Audio Processing and Recognition Final Project
This project consists in the implementation of a system that retrieves the five most similar audio files from an audio database when an audio file is presented as the input. I concentrated on indoor and outdoor environmental audio files. Audio is a very important kind of media that includes speech, music and various kinds of environmental noise. With the recent public access to different audio ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008